# Long Text Support
Ruri V3 70m
Apache-2.0
Ruri v3 is a Japanese general-purpose text embedding model based on ModernBERT-Ja, supporting sequences up to 8192 tokens long and achieving state-of-the-art performance in Japanese text embedding tasks.
Text Embedding Japanese
R
cl-nagoya
865
1
Embedder Collection
Multilingual embedding model for German and English, supporting a context length of 8192 tokens
Text Embedding Supports Multiple Languages
E
kalle07
6,623
10
Khmer Mt5 Summarization 1024tk V2
Apache-2.0
An improved Khmer text summarization model based on mT5-small, supporting inputs of up to 1024 tokens, suitable for summarizing Khmer articles, paragraphs, or documents.
Text Generation
Transformers Other

K
songhieng
16
1
Inf Retriever V1 1.5b
Apache-2.0
INF-Retriever-v1-1.5B is a dense retrieval model based on large language models developed by INF TECH, optimized and fine-tuned for Chinese-English data retrieval tasks.
Text Embedding
Transformers Supports Multiple Languages

I
infly
19.59k
25
Snowflake Arctic Embed L V2.0 Gguf
Snowflake Arctic-embed-l-v2.0 is the latest embedding model released by Snowflake, specifically designed for multilingual workloads, optimizing retrieval performance and inference efficiency.
Text Embedding Supports Multiple Languages
S
Casual-Autopsy
4,066
8
Snowflake Arctic Embed L V2.0 GGUF
Apache-2.0
The GGUF quantized version of Snowflake Arctic Embed L v2.0 is an efficient multilingual text embedding model, suitable for high-quality retrieval tasks.
Text Embedding
S
limcheekin
129
1
Ruri Large V2
Apache-2.0
Ruri is a Japanese universal text embedding model, focusing on sentence similarity calculation and feature extraction, with support for long text processing.
Text Embedding Japanese
R
cl-nagoya
3,672
9
Ruri Large
Apache-2.0
Ruri-Large is a high-performance embedding model specialized in Japanese text similarity calculation, based on transformer architecture with support for long text processing (maximum length 8192).
Text Embedding
Safetensors Japanese
R
cl-nagoya
6,784
41
Ruri Small
Apache-2.0
Ruri is a model specialized in Japanese text embedding, capable of efficiently calculating sentence similarity and extracting text features.
Text Embedding Japanese
R
cl-nagoya
11.75k
9
Ruri Base
Apache-2.0
Ruri is a universal text embedding model for Japanese, focusing on sentence similarity and feature extraction tasks.
Text Embedding Japanese
R
cl-nagoya
523.56k
9
Gte Multilingual Reranker Base
Apache-2.0
The first multilingual reranking model in the GTE series, supporting 70+ languages with high performance and long text processing capabilities.
Text Embedding
Transformers Supports Multiple Languages

G
Alibaba-NLP
239.91k
122
Bge M3 Onnx O4
MIT
This is the ONNX quantized version of the BAAI/bge-m3 model, supporting three functionalities: dense retrieval, multi-vector retrieval, and sparse retrieval, covering over 100 languages.
Text Embedding
Transformers

B
hooman650
285.96k
10
Chinese Llama 2 7b
Apache-2.0
Chinese-LLaMA-2-7B is an extended Chinese version of Meta's Llama-2 model, optimized with an expanded Chinese vocabulary and incremental pre-training to enhance Chinese comprehension.
Large Language Model
Transformers Supports Multiple Languages

C
hfl
1,824
102
Mlong T5 Large Sumstew
Apache-2.0
This is a multilingual, long-text (supports up to 16k input tokens) abstractive summarization model. Trained on the sumstew dataset, it can generate titles and summaries for given input documents.
Text Generation
Transformers Supports Multiple Languages

M
Joemgu
103
9
Pegasus Aeslc
PEGASUS is a pretrained model based on gap sentence extraction, specifically designed for abstractive text summarization tasks.
Text Generation
Transformers English

P
google
21
0
Featured Recommended AI Models